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Abstract — In this paper, we investigate the robust transceiver 
design for dual-hop amplify-and-forward (AF) MIMO relay 
systems with Gaussian distributed channel estimation errors. 
Aiming at maximizing the mutual information under imperfect 
channel state information (CSI), source precoder at source and 
forwarding matrix at the relay are jointly optimized. Using some 
elegant attributes of matrix-monotone functions, the structures of 
the optimal solutions are derived first. Then based on the derived 
structure an iterative waterfilling solution is proposed. Several 
existing algorithms are shown to be special cases of the proposed 
solution. Finally, the effectiveness of the proposed robust design 
is demonstrated by simulation results. 

I. Introduction 

Cooperative communication is one of the key parts of 
the future communication protocols, as the deployment of 
relays can improve link equality, extend coverage range and 
mitigate inference. In general, there are various relay strategies 
which are casted into three main categories, i.e., amplify-and- 
forward (AF), decode-and-forward (DF) and compressed-and- 
forward (CF). Among these relaying strategies, AF strategy 
which has the lowest complexity is most suitable for practical 
implementation. 

It is also well-established that adopting multiple antennas 
has a potential to improve overall wireless system perfor- 
mance. In order to reap both benefits promised by cooper- 
ative communication and multi-input multi-output (MIMO) 
systems, linear transceiver design for AF MIMO relaying 
systems has been widely researched in IT]— S. Generally, 
speaking there are two main kinds of criteria for transceiver 
design: capacity maximization and mean-square-error (MSE) 
minimization. Joint design of relay forwarding matrix and 
destination equalizer for minimizing MSE is discussed in fl] 
and 13. Furthermore, joint design of source precoder relay for- 
warding matrix and destination equalizer for minimizing MSE 
is investigated in 13. The capacity maximization transceiver 
design has been discussed in 0, 0. 

Q In most of previous works, channel state information (CSI) 
is assumed to be perfectly known. However, this assumption 
cannot be met in practice. Channel estimation errors are always 
inevitable and drastically degrades system performance. It 
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is well known that robust designs can reduce or mitigate 
the negative effects introduced by imperfect CSI. This is 
also the motivation of our work. In this paper, we jointly 
optimize source precoder matrix and relay forwarding matrix 
for mutual information maximization under channel estimation 
errors. Based on the properties of matrix-monotone functions, 
the optimal structure of robust transceivers is derived. Then, 
an iterative water-filling solution is proposed. Finally, the 
numerical result shows the performance advantage of the 
proposed robust design. 

The following notations are used throughout this paper. 
Boldface lowercase letters denote vectors, while boldface 
uppercase letters denote matrices. The notation Z H denotes 
the Hermitian of the matrix Z, and Tr(Z) is the trace of the 
manix Z. The notation Z 1 / 2 is the Hermitian square root of the 
positive semi-definite matrix Z, such that Z 1 / 2 Z 1 / 2 = Z and 
Z 1 / 2 is also a Hermitian matrix. For a rectangular diagonal 
matrix A, A \ denotes the main diagonal elements are in 
decreasing order and A /*■ denotes the main diagonal elements 
are in increasing order. For two Hermitian matrices, C > D 
means that C — D is a positive semi-definite matrix. The 
symbol A^(Z) represents the i th largest eigenvalue of Z. 

II. System Model and Problem Formulation 

A. Transmitted and Received Signals 

In our work, a dual-hop AF MIMO relay system is inves- 
tigated, in which there is one source with N$ antennas, one 
relay with Mr receive antennas and Nr transmit antennas, and 
one destination with Mb antennas. Because of deep fading, 
the direct link between the source and destination is not taken 
into account. At the first hop, the source transmits data to the 
relay. The received signal at the relay is denoted as 

x = H sr Ps + ni (1) 

where H sr is the MIMO channel matrix between the source 
and the relay, and P is the precoder matrix at the source. The 
vector s is the x 1 data vector transmitted by the source with 
the covariance matrix R s — E{ss H } = 1^. Furthermore, ni 
is the additive Gaussian noise vector with correlation matrix 

At the relay, the received signal x is multiplied by a 
forwarding matrix F. Then the resultant signal is transmitted 



to the destination. The received signal y at the destination can 
be written as 

y = H rd FH sr Ps + H rd Fm + n 2 , (2) 

where H r <j is the MIMO channel matrix between the relay and 
the destination, and n 2 is the additive Gaussian noise vector 
at the second hop with covariance matrix R„ 2 = cr 2 2 Im d ■ In 
order to guarantee the transmitted data s can be recovered at 
the destination, it is assumed that Ng, Mr, Nr, and Mq are 
greater than or equal to N |3l. 

When channel estimation errors are taken into account, the 
dual-hop channels read as 

H sr = H sr + AH sr , H r d = H r d + AH rc j, (3) 

where H sr and H r d are the channel estimates and AH sr and 
AH r( i are the corresponding estimation errors with zero-mean 
Gaussian distributed entries. Additionally, the estimation errors 
are independent with each other as the channels are separately 
estimated. Referring to estimation errors, the following widely 
used Kronecker structure is adopted J6) ifTTIl 

AH.,,,, = sV 2 H Wj5r *V 2 AH rd = £^ 2 H Wird *^ 2 , (4) 

where the entries of Hw.sr and Hw.rd are identical and 
independent distributed (i.i.d.) with zero mean and unit vari- 
ance. The column correlation matrices (*& sr and \& re j) and 
the row correlation matrices (S sr and E r <j) are determined 
by training sequences and channel estimators JS). To the best 
of the authors' knowledge, for a general case even for a point- 
to-point MIMO system, there is no closed-form solution. In 
this paper, we focus on the case with column correlations only 
i.e., 

S sr = ail, S rd = a 2 I, (5) 

as this case corresponds to a practical linear minimum mean 
square error (LMMSE) channel estimator |8). 

B. Problem Formulation 

At the destination, a linear equalizer G is adopted to detect 
the data vector s. The mean-square-error (MSE) matrix is 
E{(Gy — s)(Gy — s) H } , where the expectation is taken with 
respect to random data, channel estimation errors, and noise. 
In 0, it is shown that 

E{(Gy-s)(Gy-s) H } 

= G(H rd FR x F H H« + K 2 )G H + I N 

- (P H H»F H H=G H ) - (GH r dFH sr P), (6) 

where matrices R x and K 2 are defined as 

R x 4 E{xx H } = H sr PP H H^ + Ki 

Ki 4 Tr(PP H * sr )£ sr + R ni 

K 2 4 Tr(FR x F H * rrf )£ r(i + R„ 2 . (7) 

It is obvious that R x is the covariance matrix of the received 
signal at the relay. Using linear Bayesian theory, the LMMSE 
equalizer at the destination equals to 

G = (H rd FH sr P) H (H. rd FR x F H H^ + K 2 )-\ (8) 



based on which the MSE matrix in ® is rewritten as 

*mse(F,P) 
=1 - (H rd FH sr P) H (H rd FR x F H H^ + K^ 1 
x (H rd FH sr P) (9) 

Capacity is one of the most important and widely used per- 
formance metrics for transceiver designs. Given the received 
pilots in channel estimation as yi and y 2 , the channel capacity 
is denoted as I(s; y|yi, y 2 ), which is the conditional mutual 
information based on known y x and y 2 ifTTI . To the best of 
our knowledge, the exact capacity for MIMO channels with 
estimation errors at both ends is largely open even for point- 
to-point MIMO systems lfTTll . To proceed, a common logic is 
to derive and use bounds i.e., lower bound or upper bound. 
Since we aim to maximize channel capacity, lower bound is 
more meaningful than upper bound. In Appendix|A]it has been 
proved that 

-log|*MS E (F,P)| </(s;y|yi,y 2 ). (10) 

This is a widely-established lower bound HI 1 II and becomes 
tighter as estimation errors are smaller. 

As a result, the robust transceiver design for maximizing 
mutual information is formulated as 

min log|*MSE(F,P)| 

s.t. Tr(FR x F H ) < P r Tr(PP) < P s . (11) 

Based on the definition of R x in ((T), R x is a function of P. 
In order to simplify the analysis, we define a new variable 

F 4 FKl /2 (K7 1/2 H sr PP H H^,K7 1/2 + I) 1 / 2 , (12) 

S v ' 

based on which FR X F H = FF H and the two constraints 
involved in (fTTT i become independent. Meanwhile, the MSE 
matrix in (O is correspondingly rewritten as 

*mse(F,P) = I - (H r£i Fn p 1/2 K- 1/2 H s? .P) H (H rci FF H 
xfiH + K 2 )- 1 (H rd Fn p 1/2 K- 1/2 H sr P). (13) 

Finally, the optimization problem for the robust design be- 
comes 

min log|*MSE(F,P)| 

F,P 

s.t. Tr(FF H ) < P r Tr(PP) < P s . (14) 

In the following, the optimal solutions of < TT~4T > will be discussed 
in details. 

III. Optimal Solutions 

In our work, we investigate the optimization problem dT4b 
from matrix-monotone function viewpoint. The idea of uti- 
lizing the properties of matrix-monotone functions to design 
MIMO transceivers has been address in iflOl . In this paper, we 
extend this idea to robust transceiver designs for a AF MIMO 
relaying system. 



Definition 1: A matrix-monotone function is defined as g(») 
which maps a matrix variable from a subsect of positive semi- 
definite matrices to a real number. If <;(•) is a monotonically 
decreasing matrix-monotone function on positive semi-definite 
matrices, it satisfies 



A y B h -> g(A) < g(B). 



(15) 



On the other hand, when </(•) a monotonically increasing 
matrix-monotone, it means — g(u) is a monotonically decreas- 
ing matrix-monotone function ifTUl . 

In the following we focus our attention on a kind of 
optimization problems with a decreasing matrix-monotone 
function as objective, which is formulated as 



mm 
x 



X H H H HX— 

Tr(XX H ) <P r] x = Tr(XX H *)a + cr 2 . (16) 

Solving the optimization problem ( fToT ), two important solu- 
tions are derived and are the basis for the following deriva- 
tions. 

Conclusion 1: The optimal solution of ( [Tol l satisfies 

Tr(XX H ) = Tr[XX H (aP* + (J 2 n l)}/r} x = P. (17) 

Defining the unitary matrix Vh and rectangular diagonal ma- 
trix Ah based on the following singular value decomposition 

H(aP* + all)' 1 / 2 = UhA h V h with A H \, (18) 

the optimal solution of the optimization problem ( fToT ) has 
following structure 



1 



s.t. 



-(aP* + e^I)-V a V H A x Ul 



-A-opt " 

with AxAjjA H A x \, 

and r, x = a 2 J{\ ~ aTr[Vg(aP* + «7*I)-V a 
x *K* + ^I)-V 2 VhAxAS}, 



(19) 



where Uh is an unitary matrix and Ax is a rectangular 
diagonal matrix with real diagonal elements. 

Proof: See Appendix [B] ■ 
Conclusion 2: For a complex matrix A and a positive define 
matrix N, based on following eigen-decomposition 

AN -1 A H = UanaA A naV£ na with A A na \ (20) 

when the objective function is 



g (X H H H HX/»fe) - log|A H (X H H H HX/^ + I)- 

the unitary matrix Uh in <fl~9b equals to 
U= = Uana- 

Defining N x = min{Rank(H H H), Rank(AA H )}, A x in 
([T9T l has the structure of 



A + N| 
(21) 

(22) 



A x 









(23) 



where A x is a Nx X Nx diagonal matrix. 
Proof: See Appendix ICl ■ 



A. The structure of optimal F 

In this section the structure of optimal F is derived. Based 
on the matrix inversion lemma, the MSE matrix in ( fT3l can 
be rewritten as 

*mse(F,P) 

=(n p 1/2 K- 1/2 H sr P) H (F H fiH K^H rd F + 1)" 1 
x (n p 1/2 K7 1/2 H sr P) + (P H fiH K^ x H sr P + I)- 1 



=A F 



=N F 



based on which for any given P the optimization problem with 
respect to F becomes as 

min loglA^^H^K^H^F + P^Ap + N P | 

F 

s.t. Tr(FF H ) < P r K 2 = Tr(FF H * rd )S rd + a 2 J. 

(24) 

Defining unitary matrices Ui and Vi based on the follow- 
ing singular value decomposition 

K~ 1/2 H sr P = U1A1V? with Ai \, 



A P Np ! A? 



(25) 

we have the following eigen-decomposition 

Lp = W1L11 li-lAj \ . (26) 

Together with the following singular value decomposition 

U rd {a 2 P r ^rd + alX 1 ' 2 = U rd A rd V? d with A rd \, 

and based on Conclusions 1 and 20, the optimal F has the 
following structure 



a 2 P r * rd + a 2 ,I)- 1 / 2 V I , diAr A F UH 



' =VVf 

~ 2 - 

with A F A rd A F \ 



N 



and n f = a 2 n J{\ - a 2 Tr[V5 N {a»P r 9rd + <I)~ 1/2 



x ^ rd (a 2 P r ^ rd + a 2 2 I)- 1 / 2 V rd , 7V A|]}, (27) 

where A F is a N x N diagonal matrix. Meanwhile, for the 
optimal F the following constraint is fulfilled 

Tr(FF H ) = Tr[FF H (a 2 P r * rd + <I)]A?/ = P r . (28) 

B. The structure of optimal P 

In the following, it will be proved that given the structure 
of F the optimization problem for P is the same as that for 
F. Using the optimal structure of F, we have 



r 7 2 - 



N 



(F^K^H^F + I)- 1 = (U ljAr A F A rd A F U^ 
Using the following substitution 

A 2 4(A f Aj d A f +I)"V, 

and the matrix inversion lemma again, the MSE matrix can be 
reformulated as 



(29) 



*mse(F, P) =Vi(I - A 2 ) 1/2 Vf (P H Hf r Kr 1 H sr P + I) 
x Vi(I- A 2 ) 1/2 Vf + ViA 2 Vf . 



(30) 



1 Notice that S r 



Therefore, the optimization problem with respective to P is 
equivalent to 

mm log|A?(P H fiH K^H^P + 1)"^ + N p | 

s.t. Tr(PP H ) < P s , Kx = Tr(PP H * sr )£ sr + a 2 ni I. 

(31) 

Based on the definitions of A F and N F , it can be derived that 
AsN^ 1 A9 has the following eigen-decomposition 

" F F 

Ap N- : A| = VxApA^ApV" (32) 
Together with following singular value decomposition, 



H sr (ci!iP s * sr + al 1 I) 



-1/2 _ 



U sr A sr V* 



(33) 



and using Conclusions 1 and 20, the optimal P has the 
following structure 

P = ^r p ( ai P s y sr + al\)- l ' 2 Y srtN k P Vf 

- 2 - 

with A P A sr Ap \ 

and Vp = alj{\ - ai Tr[V^ N ( ai P s * sr + <I)- 1/2 
x * sr ( ai P s * sr + all)-^ 2 V srtN A 2 p }}, (34) 

where Ap a N x N diagonal matrix. Considering that as there 
are no constraints on Vi, Vi can be an arbitrary NxN unitary 
matrix. Based on Conclusion 1, it can be concluded that the 
optimal P satisfies 

Tr(PP H ) = Tr[PP H ( ai P s * sr + <J 2 n I)]/ Vp = P s . (35) 

Substituting d34l i into i25i . it can be derived that Ui jv = 
XJ sr ,N and then the optimal structure of F is 



F = VW(«2P-*,. d + <I)- 1/2 V 



rd,NA-f*-> srN . (36) 



Based on the optimal structure given by ( 1341 ) and Q6j and 
with regard to the fact that ?y p and r/f are determined by Ap 
and A F , respectively, the left unknown variables are only Ap 
and Ap. 

C. Proposed Solutions for A F and Ap 

Based on d28i > and (13 St . the optimization problem (fl4l also 
equals to 

min log|*MSE(F,P)| 



F,P 



s.t. Tr[FF H (a 2 F r * rd + a^I)]/^ = P r 
Tr[PP H ( ai P s * sr + <I)]/r? p = P s . 

Furthermore, with the following diagonal matrices 

A sr = diag{A sr!i } A rd = diag{A rd ^} 
A F = diag{/,} A P = diag{p,} 



(37) 



(38) 



and substituting (|34l and d36] l into d37t . the optimization 
problem (|37| | can be rewritten as 



N 



log- 



fi Kd,i + Pi \r,( + 1 



With respective to the fact the problem (|39t is inherently non- 
convex and difficult to solve, an iterative water-filling solution 
is proposed in this paper. When p,'s are fixed, fi's can be 
computed as 



piKr,i + \ (plKr, i) 2 + 



V 



\ 2 

A rd,i 



J 



(40) 

where fif > is the Lagrange multiplier which makes 
Y^ifi = Pr- On the other hand, when fi's are fixed p^'s 
can be computed as 



2 

Pi = 



(/ML,) 2 



+ 



2A 



V 



\ 2 

A sr,i 



/ 



(41) 

where /i p > is the Lagrange multiplier which makes 

Special cases: Several existing algorithms can be considered 
as special cases of our proposed solution. 

• When CSI is perfectly known and P — I, the proposed 
solution for F reduces to that in J2). 

• When CSI is perfectly known, the proposed solution for P 
and F reduces to that given in 15). 

• When the second hop channel is an identity matrix and 
noiseless, the proposed solution for source precoder design 
reduces to that given in ifTTI . 

IV. Simulation Results and Discussions 

In this section, simulation results are presented to assess the 
performance of the proposed algorithm and for the purpose 
of comparison, the algorithm based on the estimated channel 
only (without taking the channel errors into account) |5j. In the 
following, we consider an AF MIMO relay system where the 
source, relay and destination are equipped with same number 
of antennas, i.e., N s = Mr = N R = AI D = 4. The 
channels H sr and H rc j are randomly generated according to 
i.i.d. Gaussian distribution. 

To estimate the channels, a practical LMMSE estimation 
algorithm is adopted O. For the training sequence D, a 
famous exponential correlation matrix is used to describe the 
correlation matrix of D, i.e., DD H oc R p where [Rpjy = 
pl^L As a result, S sr = H rd = I and <J> sr = *S? rd = 
(I4 + SNRestRq) 1 where SNRest is the signal-to-noise 
ratio (SNR) in channel estimation process |80. 

In the simulation, for data transmission process the SNR 



3 Notice that S sr = oil. 



4 The detailed derivation is given in (8] 




5 10 15 20 25 30 

SNR (dB) 



Fig. 1. Sum rates of different algorithms when p = 0.5. 

at relay is defined as P s /ai 2 , and the SNR at destination 
is defined as P r ja\. For simplicity, it is also assumed that 
P s /(Ji 2 = P r ja\. Each point in the following figure is an 
average of 10000 independent channel realizations. 

Fig. [TJ shows the sum rates of different algorithms including 
the proposed robust design and its counterpart based on 
estimated CSI only when p = 0.5. It can be seen that 
the performance of the proposed robust design is always 
better than that of the design based estimated on CSI only. 
Furthermore, as the channel estimation SNR decreases the 
performance gain of the robust design becomes larger. 

V. Conclusions 

Robust mutual information maximization transceiver design 
for dual-hop AF MIMO relay systems was investigated. With 
Gaussian distributed channel errors the precoder at the source 
and forwarding matrix at the relay were jointly designed. The 
structures of the optimal solutions were derived first, which 
differentiates our work from the existing works. For the un- 
known diagonal matrices, a well-known iterative water-filling 
solution was proposed. The simulation result demonstrated the 
performance advantage of our robust design. 

Appendix A 
Lower Bound of Capacity 

Denoting yi and y 2 as the received pilots in the separate 
dual hop channel estimations, the capacity between the source 
and destination equals to 

I(s;y|yi,y 2 ) =H(s|yi 1 y2)-H(s|y,y 1 ,y 2 ) (42) 

=H(s) 

where H(s|yi,y 2 ) denotes the conditional entropy of s when 
yi and y 2 are known and H(s|y,yi,y2) is the conditional 
entropy of s when y, yi and y 2 are known. As yi and y 2 are 
independent with s, H(s|yi,y2) = H(s). The second term on 
the right hand side of (l42l will be discussed in the following. 

Denoting s = E| y yi , y2 {s} as the conditional mean and 
based on the definition of covariance matrix, the conditional 
covariance matrix satisfies 

Cov(s| y , yi ,y 2 )=E{(s-s)(s-s) H } 

d E{(s - s)(s - s) H } + E{(s - s)(s - s) H 

(43) 



where s is an arbitrary estimate of s including LMMSE 
estimate. It should be pointed out that as CSI is not perfectly 
known, § ^ s. Notice that s is the conditional mean and thus 
we have 

E{(s-s)(s-s) H } =0 (44) 
based on which the right hand side of (l43l also equals to 

E{(s - s)(s - s) H } + E{(s - s)(s - s) H } 
= E{(s - s + s - s)(s - s + s - s) H } 
= E{(s-s)(s-s) H }. (45) 

Substituting (l45T l into (1431 . we have 

Cov(s|y, yi ,y 2 ) ^E{(s-s)(s-s) H } = # MS e(F,P). 

(46) 

It is also well-known that with fixed covariance matrix, Gaus- 
sian distribution has the maximum entropy. Therefore, it is 
concluded that 

H(s|y,yi,y 2 ) < E{log|7reCov(s|y, yi, y 2 )|} 

<log|7re* MSE (F,P)|, (47) 

based on which, an lower bound of the capacity ( l42l is 

J(y;s|yi,y 2 ) 

> H(s) -log|^e*MSE(F,P)| = -Iog|*MSE(F,P)|. (48) 

The final equality comes from the fact R s = I. 

Appendix B 
Proof of Conclusion 1 

As £/(•) is a matrix monotonically decreasing function, it 
can be proven that for the optimal solution the power constraint 
is always active, i.e., Tr(XX H ) = P 1101 , As a result, we have 
the following relationship. 

i lx = aTr(XX H *) + o* 




=i 



= Tr (XX H (aP* + all) )/P. (49) 

From d49l ), the constraint of the optimization problem ( fT6l ) 
equals to 

Tr(XX H ) = Tr[XX H (aP* + o^I)]/^ = P, (50) 
based on which the optimization problem ( fl6l ) is equivalent to 

min g ( X H H H H— X ) 

s.t. Tr[XX H (aP* + a 2 n I)]/n x = P. (51) 
Then, defining a new variable 

X = l/V^(aP* + ^i)V2 X) (52) 
the optimization problem d5Tb is further reformulated as 
min g (X H (aP* + cr^I)- 1 / 2 H H H(aP* + c^rr^x) 
s.t. Tr(XX H ) = P. (53) 



For any given X, based on the following singular decom- 
positions 

H(<xP* + all)~ 1/2 ± = V H A 3 U| with A H \ 
H(aP* + all)- 1 ' 2 = U H A H Vg with A H \, (54) 



there exists a matrix X satisfying 

X = v h a x u1 

with 1/6A H A X = A H 



(55) 
(56) 



where Ax is a diagonal matrix with the same rank as A= 
and b is a scalar which makes Tr(XX H ) = P hold. Based on 
Lemma 12 in lfT2l . the following inequality holds 

X H (aP* + ^I)- 1 / 2 H H H(aP* + o*T)-V a X 
hX H (aP* + <7 2 I)- 1/2 H H H(aP* + cr 2 I)" 1/2 X. (57) 

Together with the fact that </(•) is a matrix monotonically 
decreasing function, the following inequality holds 

9 (X H (aP* + a 2 I)- 1 / 2 H H H(aP* + a*!)- 1 / 3 *.) < 
g(X H (aP* + cr 2 I)- 1 / 2 H H H(aP* + al\)- l / 2 ±). (58) 

Therefore, it is concluded that the optimal X has the structure 
given by ((55). Furthermore, based on the definition of X d52l . 
the optimal X has the following structure 

X opt = + a 2 I)- 1 / 2 V H A x U| (59) 



positive semi-definite matrices M and TV i.e., \M + N\ > 
]1(A 4 (M) + Xi(N)) HI, we directly have 

log|A H (X H H H HX/V + I)- 1 A + N| > 

log|N| + y log[Aj(AN _1 A H ) + A ? (X H H H HX/ ??;c + I)] 



^log[A 4 (X H H H HX/r ? ,+I)] 



(63) 



Together with the optimal structure given by Conclusion 1, in 
order to make the equality in (l63l hold the following equation 
holds 



Us = Uana- 



(64) 



In light of the fact that power is never loaded to the eigen- 
channels with zero magnitudes 1121 . the diagonal matrix A x 
has the following structure 



A x 




(65) 



where Ax is a diagonal matrix. In (1591 r\ x is unknown either. 
In order to solve r\ x , substitute the structure of X in (l59l into 
the definition of r\ x in dl6) . and then we get the following 
equation 

Tfe =Tr(XX H *)a + a, 2 

= r fc aTr[V x (aP* + ( x 2 I)^ 1 / 2 *(aP* + a 2 I)" 1/2 
x V X A X A X ] + al . (60) 

This is a simple linear function of r\ x , and r\ x can be easily 
solved to be 

Vx = a 2 /{l - aTr[V x (aP* + a 2 I)- 1 / 2 *(«P* + a 2 n l)- l/2 
x V X A X A X ]}. (61) 

Appendix C 
Proof of Conclusion 2 

The objective function in (fJT) can be reformulated as 

log|A H (X H H H HX/? ?;c + I)" 1 A + N| 
=log|N||A H (X H H H HX/r7 x +I)~ 1 Ar>T 1 +I| 
=log|N| + log|(X H H H HX/?; I . + I) _1 AN _1 A H + I| 
=log|N| + loglAN^A 11 + (X H H H HX/^ + I)| 

-log|(X H H H HX/V + I)| (62) 

where the second equality is based on the fact that \AB + 
1 1 = \BA + I|. Using the matrix inequality that for two 



where A x is a Nx X Nx diagonal matrix and Nx = 
min{Rank(H H H), Rank(AA H )}. 
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